Equal sentence signature of a certain minimum length encourages us to compare the corresponding sentences. Her we look for equal signatures of minimum length 9.
The tables will show whether similar sentence signatures give rise to similar sentences in a particular language.
The results should be more impressive for larger corpora.
For the most frequent signature:
select @sig:=sig, @cnt:=anz from (select signature_untok as sig, count(*) as anz from para_s where length(signature_untok)>9 group by signature_untok order by anz desc limit 1) aa;
4.7.1.1 Most Frequent Sentence Signatures